NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CHAMMI: A benchmark for channel-adaptive models in microscopy imaging

Chen, Zitong_Sam; Pham, Chau; Wang, Siqi; Doron, Michael; Moshkov, Nikita; Plummer, Bryan; Caicedo, Juan_C (February 2024, Advances in neural information processing systems)

Most neural networks assume that input images have a fixed number of channels (three for RGB images). However, there are many settings where the number of channels may vary, such as microscopy images where the number of channels changes depending on instruments and experimental goals. Yet, there has not been a systemic attempt to create and evaluate neural networks that are invariant to the number and type of channels. As a result, trained models remain specific to individual studies and are hardly reusable for other microscopy settings. In this paper, we present a benchmark for investigating channel-adaptive models in microscopy imaging, which consists of 1) a dataset of varied-channel single-cell images, and 2) a biologically relevant evaluation framework. In addition, we adapted several existing techniques to create channel-adaptive models and compared their performance on this benchmark to fixed-channel, baseline models. We find that channel-adaptive models can generalize better to out-of-domain tasks and can be computationally efficient. We contribute a curated dataset and an evaluation API to facilitate objective comparisons in future research and applications.
more » « less
Full Text Available
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging

Chen, Zitong; Pham, Chau; Wang, Siqi; Doron, Michael; Moshkov, Nikita; Plummer, Bryan A.; Caicedo, Juan C. (December 2023, Thirty-seventh Annual Conference on Neural Information Processing Systems)

Full Text Available
Predicting compound activity from phenotypic profiles and chemical structures

https://doi.org/10.1038/s41467-023-37570-1

Moshkov, Nikita; Becker, Tim; Yang, Kevin; Horvath, Peter; Dancik, Vlado; Wagner, Bridget K.; Clemons, Paul A.; Singh, Shantanu; Carpenter, Anne E.; Caicedo, Juan C. (December 2023, Nature Communications)

Abstract Predicting assay results for compounds virtually using chemical structures and phenotypic profiles has the potential to reduce the time and resources of screens for drug discovery. Here, we evaluate the relative strength of three high-throughput data sources—chemical structures, imaging (Cell Painting), and gene-expression profiles (L1000)—to predict compound bioactivity using a historical collection of 16,170 compounds tested in 270 assays for a total of 585,439 readouts. All three data modalities can predict compound activity for 6–10% of assays, and in combination they predict 21% of assays with high accuracy, which is a 2 to 3 times higher success rate than using a single modality alone. In practice, the accuracy of predictors could be lower and still be useful, increasing the assays that can be predicted from 37% with chemical structures alone up to 64% when combined with phenotypic data. Our study shows that unbiased phenotypic profiling can be leveraged to enhance compound bioactivity prediction to accelerate the early stages of the drug-discovery process.
more » « less
Full Text Available
Learning representations for image-based profiling of perturbations

https://doi.org/10.1038/s41467-024-45999-1

Moshkov, Nikita; Bornholdt, Michael; Benoit, Santiago; Smith, Matthew; McQuin, Claire; Goodman, Allen; Senft, Rebecca A.; Han, Yu; Babadi, Mehrtash; Horvath, Peter; et al (February 2024, Nature Communications)

Abstract Measuring the phenotypic effect of treatments on cells through imaging assays is an efficient and powerful way of studying cell biology, and requires computational methods for transforming images into quantitative data. Here, we present an improved strategy for learning representations of treatment effects from high-throughput imaging, following a causal interpretation. We use weakly supervised learning for modeling associations between images and treatments, and show that it encodes both confounding factors and phenotypic features in the learned representation. To facilitate their separation, we constructed a large training dataset with images from five different studies to maximize experimental diversity, following insights from our causal analysis. Training a model with this dataset successfully improves downstream performance, and produces a reusable convolutional network for image-based profiling, which we call Cell Painting CNN. We evaluated our strategy on three publicly available Cell Painting datasets, and observed that the Cell Painting CNN improves performance in downstream analysis up to 30% with respect to classical features, while also being more computationally efficient.
more » « less

Search for: All records